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Open 



Protista are ubiquitous members of soil microbial communities, but the structure of these 
communities, and the factors that influence their diversity, are poorly understood. We used 
barcoded pyrosequencing to survey comprehensively the diversity of soil protists from 40 sites 
across a broad geographic range that represent a variety of biome types, from tropical forests to 
deserts. In addition to taxa known to be dominant in soil, including Cercozoa and Ciliophora, we 
found high relative abundances of groups such as Apicomplexa and Dinophyceae that have not 
previously been recognized as being important components of soil microbial communities. Soil 
protistan communities were highly diverse, approaching the extreme diversity of their bacterial 
counterparts across the same sites. Like bacterial taxa, protistan taxa were not globally distributed, 
and the composition of these communities diverged considerably across large geographic 
distances. However, soil protistan and bacterial communities exhibit very different global-scale 
biogeographical patterns, with protistan communities strongly structured by climatic conditions 
that regulate annual soil moisture availability. 
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Introduction 

Protists are unicellular eukaryotes (Adl et al., 2005) 
and critical components of microbial communities 
in both aquatic and terrestrial environments, where 
they are integral constituents of trophic chains and 
nutrient cycles (Luxton and Petersen, 1982; Sherr 
and Sherr, 2002; Cuvelier et al, 2010; Steele et al, 
2011). As parasites and disease agents, they impact 
the health of humans, livestock and agricultural 
crops (Aurrecoechea et al., 2010). Despite their 
ecological and economic importance, and regular 
study since the time of van Leeuwenhoek, protistan 
diversity remains poorly characterized (Caron et al., 
2009). The biogeographical patterns exhibited by 
protistan communities are also poorly understood, 
and have been the subject of recent debate (Foissner, 
2006; Bass et al., 2007), with suggestions that the 
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diversity of microbial eukaryotes may show limited 
variability across large spatial scales (Finlay, 2002). 

Historically, surveys of protistan diversity have 
largely relied on morphological taxon identification; 
however, the limitations of such approaches are well 
known (Bass et al., 2007). Thus, more recent work 
has utilized DNA sequence-based methods to survey 
protists from a range of localities within oceans 
(Lopez-Garcia et al., 2001; Cuvelier, et al., 2010; 
Demir-Hilton et al., 2011; Steele et al., 2011) and 
freshwater lakes (Katz et al., 2005; Triado-Margarit 
and Casamyor, 2012). Such studies continue to 
reveal a broad spectrum of previously unrecognized 
diversity within the group (Lopez-Garcia et al., 
2001; Demir-Hilton et al., 2011; Kim et al., 2011), 
just as similar molecular approaches have revolu- 
tionized our understanding of bacterial diversity 
(Pace, 1997). However, previously published mole- 
cular studies have primarily focused on protistan 
diversity in aquatic systems. Molecular surveys of 
terrestrial protistan diversity are rare even though 
soils harbor large numbers of protists (typically 
10^-10^ active individuals g"^) that are important 
components of biogeochemical cycles (Adl and 
Gupta, 2006; Howe et al., 2009). As a result, we 
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have an inadequate understanding of the factors 
structuring soil protistan communities and their 
biogeographical patterns, nor do we know how 
protistan diversity patterns compare with those 
exhibited by soil bacterial communities, which have 
been relatively well-studied. 

Here we used high-throughput pyrosequencing of 
the 18S ribosomal RNA (rRNA) gene to conduct a 
detailed and comprehensive survey of soil protistan 
diversity. We also examined the continental-scale 
biogeographical patterns exhibited by soil protists, 
just as broad-scale molecular surveys have been 
used previously to reveal the biogeographical pat- 
terns exhibited by soil prokaryotes (Lauber et al., 
2009; Bates et al, 2011), terrestrial eukaryotic 
mesofauna (Wu et al., 2011) and marine protists 
(Katz et al, 2005; Cuvelier et al, 2010; Demir-Hilton 
et al., 2011). We collected samples across 40 broadly 
distributed sites that were selected to represent a 
wide range of soil and biome types. As each of the 
soils and sites were well-characterized and their 
bacterial communities surveyed in detail, we were 
also able to investigate the influence of edaphic and 
climatic factors on protistan community structure, 
and directly compare patterns of protistan diversity 
to those exhibited by soil bacteria. 



Methods 

Sampling, site characterization and isolation of soil 
DNA 

The 40 soil samples collected for this study include 
sites across North and South America, as well as 
Antarctica. These represented a wide variety of 
biomes and soil types from diverse climates. Fierer 
and Jackson (2006) previously described the soil 
sampling protocol, outlined briefly as follows. 
During peak growing season for vascular plants 
(with the exception of the mainly plant-free Antarc- 
tic sites), samples were collected at undisturbed 
areas from the top 5 cm of mineral soil at 5-10 
randomly selected locations within an area of 
'^lOOm^. Each collection consisted of ^lOg of soil, 
sieved to remove roots and other debris. A single 
composite soil was then prepared by combining and 
thoroughly mixing all the randomly selected sam- 
ples collected at a given locality. DNA was extracted 
from a 0.25 g subsample of the composite soil using 
the MoBio PowerSoil DNA extraction kit (MoBio 
Laboratories, Carlsbad, CA, USA) and following the 
manufacturer's instructions, with the exception of 
an additional incubation step at 65 °C for lOmin, 
followed by 2min of bead beating to limit DNA 
shearing (Lauber et al., 2009). The aim of this study 
was to examine variability of microbial communities 
across a range of geographically and climatically 
diverse sites, rather than focusing specifically on 
spatial or temporal variability within individual 
sites; thus, single, composited soil samples were 
used to represent each site. To assess variation of 
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soil protistan diversity at the local scale, some 
regions (for example, Mojave Desert) were repre- 
sented by more than one sample site. These regional 
sampling sites were separated by distances of 
approximately 200-500 m and represented localities 
with distinct vegetation cover and unique edaphic 
qualities (see Supplementary Table Si). 

Each site was quantitatively categorized into one 
of four general environment types according to the 
climate moisture index (CMI) of Willmott and 
Feddema (1992). This index reflects the local 
deficiency or surplus of moisture from annual 
precipitation (P) in relation to the annual potential 
rate of evapotranspiration (PET; Thorn thwaite, 
1948), and is calculated as follows: CMl = [F/ 
PET)-1 when P<PET or CMI = 1 - (PET/P) when 
P^PET. Values indicate sites with arid ( — 1 to 
< -0.5), semiarid (-0.5 to 0), semihumid (>0 to 
0.5) or humid (>0.5 to 1) status. Climate data were 
obtained from the National Climatic Data Center 
(http://www.ncdc.noaa.gov/). Location and environ- 
mental characteristics for each site are provided in 
Supplementary Table Si. 



Polymerase chain reaction amplification of 18S rRNA 
genes and bar-coded pyrosequencing 
Genomic DNA extracted from the samples was 
prepared for pyrosequencing according to the pro- 
tocol of Lauber et al. (2009), with the exception 
that eukaryotic-specific primers were used. This 
method includes targeted polymerase chain reaction 
amplification of ca. 600 bp of the 18S small 
subunit rRNA gene using the eukaryotic-specific 
primer set F515 (5'-GTGCCAGC]V[GCCGCGGTAA-30 
and R1119 (5'-GGTGCCCTTCCGTCA-30, followed 
by triplicate polymerase chain reaction product 
pooling (per sample) to mitigate reaction-level 
polymerase chain reaction biases, and barcoded 
pyrosequencing. This primer set has been shown 
to amplify a portion of the 18S rRNA gene from a 
wide range of eukaryotic groups with few biases 
(Bates et al., 2012), and the read length is well suited 
for community analysis (Liu et al., 2007). Protocol 
and conditions follow exactly those outlined in 
Bates et al. (2012) for polymerase chain reaction, 
amplicon pooling, as well as barcoded pyrosequen- 
cing using recently developed FLX+ technology. 
Pyrosequencing was carried out at Roche Applied 
Science (Indianapolis, IN, USA). 



Sequence processing and statistical analyses 
Raw sequence data was processed, assessed for 
quality (filtered on the basis of quality score, 
sequence length and primer mismatch thresholds) 
and analyzed using the QIIME 1.4.0 software pipe- 
line (Caporaso et al., 2010). Chimera checking and 
operational taxonomic unit (OTU) grouping were 
carried in QIIME using USE ARCH (Edgar et al., 
2011), as were taxonomic assignments of recovered 
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Figure 1 Relative abundance of soil protistan taxa (y axis as group percentage of the total number of 18S rRNA gene sequences per 
sample, after rarefaction to correct for uneven sampling effort) grouped by general CMl class. Identities of the sites sampled are given on 
the X axis (see Supplementary Table Si for specific details on site and soil characteristics). 



eukaryotic OTUs (determined at ^97% similarity) 
using BLAST (Altschul et al, 1997) against the 
SILVA comprehensive ribosomal RNA database 
(http://www.arb-silva.de/) at a sequence similarity 
threshold of 0.97 and a maximum £'-value of 10"^°. 
After taxonomies had been assigned, a data set 
comprised of only protistan (excluding fungi and 
metazoans) taxa was culled from all high-quality 
sequence reads. 

Taxonomy was determined for some protistan 
OTUs of uncertain affiliations after further BLAST 
searches in GenBank (http://www.ncbi.nlm.nih.gov/ 
genbank/). Phylogenetic distance matrices were 
generated using the UniFrac method (Luzapone 
and Knight, 2005) against a tree generated in QIIME 
from a PyNAST alignment of the SILVA sequence set 
with a dynamic entropy and gap calculation filter 
apphed (McDonald et al, 2012). QIIME was also 
used for Procrustes analysis (Caporaso et al, 2011), 
whereas principal coordinate analysis. Mantel tests, 
permutational multivariate analysis of variance and 
Bray-Curtis distance matrices generation were car- 
ried out using the PRIMER v. 6 software package 
(PRIMER-E, Plymouth, WA, USA) (Clark and Gorley, 
2006). Principal coordinate analysis used both 
unweighted (based on OTU presence or absence) 
and weighted (also considering OTU relative abun- 
dance) distance matrices (UniFrac and Bray-Curtis). 
All diversity and statistical analyses were performed 
with R statistical software (http://www.r-project.org/) 
with the aid of the packages Fields and Vegan. With 
the exception of the rarefaction curves (generated at 
a depth of 1000 sequences), analyses were carried 
out on the eukaryotic data set rarefied to 150 
sequences to correct for unequal sampling efforts. 
A bacterial data set previously generated from the 
same soil samples (Bates et al, 2011), and used here 
to compare community structure between these two 
groups of soil microbes, was also rarefied at the 
same level of sampling effort (150 sequences). For 
these rarefied data sets, richness was estimated as 
the number of unique OTUs, whereas diversity 



values were calculated for the Shannon index using 
relative abundance as the number of sequences 
representing each OTU within a given sample. 



Results and discussion 

Diversity of soil protists 

A total of 29 564 high-quality protistan sequences 
were recovered from the 40 unique soils examined, 
averaging 429 sequences per sample (ranging from 
151 to 1228). A total of 1014 OTUs (determined at 
^97% similarity) were identified across the sample 
set, including representatives from five supergroups 
and 13 major phyla: Amoebozoa (Acanthamoebidae, 
Eumycetozoa, Flabellinea, Tubulinea, Incertae sedis: 
Phalansterium) , Archaeplastida (Chloroplastida) , 
Excavata (Euglenozoa, Malawimonas, Jakobida), 
Opisthokonta (Choanomonada) and SAR (Alveolata, 
Cercozoa, Stramenoplles). 

The rarified sequence data set suggests that soils 
from our sites were overwhelmingly dominated by 
taxa in the SAR supergroup (Figure 1 and Table 1), 
with the Alveolata and Cercozoa representing 66.5% 
and 22.5%, respectively, of all of the protistan 
sequences recovered. The green algae (Archaeplas- 
tida, Chloroplastida, Chlorophyta) were also highly 
abundant in a few soils from arid areas, as were the 
golden algae (SAR, Stramenoplles, Chrysophyceae; 
Figure 1). Our finding that Cercozoa (diverse 
flagellates and amoebae within Rhizaria) and Cilio- 
phora (one of three major groups within Alveolata, 
along with dinoflagellates and Apicomplexa) are 
common members of eukaryotic soil microbial 
communities is congruent with what we know from 
previously published direct observation studies of 
soil protistan communities (Adl and Gupta, 2006; 
Chao et al, 2006). The only comparable study 
using high-throughput pyrosequencing to examine 
terrestrial protistan diversity (Urich et al, 2011, who 
examined a single site in Germany) also found 
alveolates and Rhizaria to be the dominant protists 
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Table 1 Dominant protistan taxa of soils (data have been rarefied to correct for uneven sampling effort) 
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[% of samples) 
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Eukaryotes 








OTUOOOOl 


Alveolata, Cilioptiora, Intramacronucleata, Spirotrichea 


77.5 


9.07 


OTU00003 


Alveolata, Apicomplexa, Coccidia, Eucoccidiorida 


70.0 


4.57 


OTU00033 


Rhizaria, Cercozoa, Cercomonadida, Heteromitidae 


67.5 


1.43 


OTU00014 


Rhizaria, Cercozoa, Thecofilosea, Chlamydophryidae 


65.0 


3.80 


OTU00729 


Alveolata, Ciliophora, Intramacronucleata, Spirotrichea 


65.0 


2.65 


OTU04795 


Alveolata, Apicomplexa, Coccidia, Eucoccidiorida 


57.5 


2.32 


OTU00346 


Alveolata, Apicomplexa, Coccidia, Eucoccidiorida 


55.0 


1.40 


OTU00017 


Alveolata, Ciliophora, Intramacronucleata, Spirotrichea 


52.5 


1.00 


OTU00074 


Alveolata, Ciliophora, Intramacronucleata, Colpodea 


50.0 


0.65 


OTU00046 


Stramenopiles, Chrysophyceae, Chromulinales, Chromulinaceae 


47.5 


1.95 


OTU00085 


Alveolata, Ciliophora, Intramacronucleata, Colpodea 


47.5 


1.02 


OTU00077 


Rhizaria, Cercozoa, Cercomonadida, Heteromitidae 


47.5 


0.67 


OTU03988 


Rhizaria, Cercozoa, Cercomonadida, Heteromitidae 


45.0 


0.60 


OTU00103 


Rhizaria, Cercozoa, Cercomonadida, Heteromitidae 


40.0 


0.80 


OTU04137 


Alveolata, Ciliophora, Intramacronucleata, Spirotrichea 


40.0 


0.60 



Abbreviation: OTU, operational taxonomic unit. 

Relative abundance (percentage of all IBS rRNA gene sequences) and frequency (percentage occurrence of a given OTU within all soils sampled). 
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in soil. However, as detailed below, we also found a 
number of higher-level protistan taxa that have 
rarely been reported from soil. 

The rarefaction curves (Supplementary Figure Si) 
and the high numbers of rare taxa that were present 
in these soils (70% of rarified OTUs were repre- 
sented by <5 sequences) underscore the high 
diversity of these protistan communities, even when 
a more conservative approach to OTU binning was 
used (that is, 97% as opposed to 99% similarity). 
Bacteria are among the most species-rich organisms 
in the terrestrial environment (Torsvik et al., 2002; 
Fierer and Lennon, 2011), yet we found diversity 
levels of protists to be within the same order of 
magnitude as bacteria when directly comparing 
these communities across the same soil samples at 
equivalent survey depths (Supplementary Figure 
Si). The full extent of diversity within either of 
these groups, however, remains largely unknown as 
the rarefaction curves failed to asymptote. Protistan 
and bacterial communities were also similar in their 
structure, exhibiting comparable trends for both 
rank frequency and abundance across the soils 
sampled (Supplementary Figure S2). 

Distribution patterns for soil protists 
The biogeographical patterns exhibited by protistan 
communities have been the subject of vigorous 
debate (Green and Bohannan, 2006; Caron et al., 
2009). It has been hypothesized that organisms with 
small body sizes, such as soil protists, should have 
the capacity for continuous large-scale dispersal, 
such that nearly all-possible taxa will be found 
within any sample (Finlay, 2002). Models suggest 
that microbial taxa with body sizes in the 10|im 
diameter range, such as many protists, are capable of 
global dispersal (Wilkinson et al., 2012); however, 
the viability of protistan organisms or their 
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Figure 2 Protistan community dissimilarity (unweighted Uni- 
Frac) with increasing geographic distance (best-fitted linear 
regression line). UniFrac values range from 1 (completely 
different phylogenetic assemblages) to 0 (no differences observed 
between communities). 



encysting structures after long-distance areal travel 
is not well understood (Foissner, 2006). Our results 
do not support the Finlay hypothesis of cosmopo- 
litan distribution patterns as no protistan taxa were 
found across all the sampled soils (Table 1); in the 
rarified data set, only one of the 672 OTUs was 
found in more than 75% of our soil samples and the 
majority of phylotypes (84%) were restricted to 
five or fewer individual soil samples. Although 
deeper sequencing may identify more cosmopolitan 
taxa, our results highlight that numerous protistan 
taxa exhibit some degree of endemism, as hypo 
thesized by Foissner (2006) and demonstrated 
previously for terrestrial ciliates (Chao et al., 2006). 

Distance-decay relationships for microbial com- 
munity similarity have been demonstrated 
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previously for soil fungi (Green et aL, 2004), and the 
soil protistan communities examined here 
were likewise found to be less similar, generally, 
with increasing geographic distance (Figure 2). 
Distance, however, was not a very strong predictor 
of community dissimilarity (linear regression 
R^ = 0,12, P< 0.001), which varied considerably 
within most levels of geographic separation. This 
result, along with those of other recent sequence- 
based studies characterizing the diversity of 
eukaryotic soil macro- and mesofauna (Robeson 
et aL, 2011; Wu et aL, 2011), suggests that terrestrial 
protists, and soil eukaryotes in general, are much 
more diverse than previously recognized and that 
they are not cosmopolitan in their distributions as 
had previously been hypothesized. 



Factors influencing soil protistan community structure 
It is well known that, at the continental scale, alpha 
diversity levels of macroscopic organisms frequently 
correlate with energy, water availability and latitude 
(Hawkins et aL, 2003; Davies et aL, 2008). Diversity 
of soil bacteria, on the other hand, is strongly 
correlated with soil pH levels (Lauber et aL, 2009; 
Griffiths et aL, 2011). In contrast, protistan richness 
(Supplementary Figure S3) and diversity 
(Supplementary Figure S4; Shannon index) were 
only marginally influenced by pH or any other of the 
measured environmental variables (Supplementary 
Table S2). Alpha diversity of soil protists, therefore, 
does not appear to be driven by environmental 
factors known to strongly affect the diversity of 
many plant and animal taxa or soil bacteria. 
Furthermore, levels of protistan and bacterial diver- 
sity were only moderately correlated across samples 
(Supplementary Figure S5; Pearson's R = 0A6, 
P= 0.003 for richness and P = 0.38, P= 0.015 for 
Shannon index), indicating that distinct factors may 
influence diversity levels within these individual 
microbial groups. 

Although alpha diversity levels were not strongly 
correlated with environmental factors and commu- 
nity structure was highly heterogeneous across large 
spatial scales, we did observe predictable beta 
diversity patterns for both soil protistan and bacter- 
ial communities. As has been reported in a number 
of other studies (Fierer and Jackson, 2006; Lauber 
et aL, 2009; Griffiths et aL, 2011), we also found 
bacterial beta diversity patterns in soil were strongly 
related to pH (Mantel global P = 0.6-0.7, P<0.01, 
Bray-Curtis or UniFrac for both weighted and 
unweighted matrices). Shifts in community compo- 
sition for soil protists, on the other hand, were most 
strongly correlated with CMI values (Mantel global 
R = 0.5-0.6, P< 0.01, all matrices), which is an index 
of annual soil moisture availability. Soil protistan 
and bacterial community structure also differed 
significantly in ordination space (Supplementary 
Figure. S6; Procrustes Ai^ = 0.52, P>0.05), again 
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Figure 3 Principal coordinates analysis (PCoA) plot on the 
unweighted UniFrac distance matrix generated from rarefied 
taxon abundances and depicting patterns of beta diversity for 
protistan communities of soil. Points that are closer together on 
the ordination have communities that are more similar, permuta- 
tional multivariate analysis of variance indicated that differences 
between communities mapped according to site CMI classes were 
highly significant (P< 0.001). Sites where community composi- 
tion was highly similar despite large geographic separation are 
highlighted with arrows indicating (A) Antarctic Dry Valleys 
(black; EB24 and EB26) and Mojave Desert (gray; MD4 and MD5) 
of California, USA and (B) Peru (black; PE7) and Puerto Rico 
(gray; LQ3). 

suggesting that the factors structuring these micro- 
bial communities are fundamentally distinct. 

The relationship between protistan community 
structure and the general CMI class (arid, semiarid, 
semihumid and humid) was significant for both 
taxonomic- and phylogenetic-based distance mea- 
sures (Bray-Curtis or UniFrac, respectively, for both 
weighted and unweighted matrices; each permuta- 
tional multivariate analysis of variance at P<0.05). 
This trend was most evident when using the 
unweighted UniFrac distance metric (permutational 
multivariate analysis of variance, pseudo-F = 3.64, 
P= 0.001) as shown in Figure 3. As a result of this 
correlation between annual soil moisture availabil- 
ity and community similarity, assemblages of soil 
protists from very distant localities could be highly 
similar. For example, the extremely arid Antarctica 
and Mojave Desert (western United States) sites 
shared relatively similar protistan communities 
(Figure 3a), as did humid sites from Peru and Puerto 
Rico (Figure 3b). 

Within each general CMI class, the relative 
abundance of certain higher-level (for example, 
phylum or class) protistan taxa varied considerably 
(Figure 4). Dinophyceae clearly dominated more 
arid soils, whereas the Ciliophora and Apicomplexa 
were relatively more abundant in soils of humid 
sites. The Cercozoa were widespread, but tended to 
be more abundant in arid soils. The prevalence of 
Cercozoans in soil was expected, as these protists 
are known to be abundant and broadly dispersed in 
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Figure 4 Relative abundance of soil protistan taxa (y axis as 
group percentage of the total number of 18S rRNA gene sequences 
per sample, after rarefaction to correct for uneven sampling effort) 
averaged across CMl classes. 

terrestrial environments. The structure of soil 
cercozoan communities has been shown to vary 
between tropical and temperate sites (Bass et ah, 
2007; Howe et ah, 2009), and climate has also been 
suggested as an important factor in shaping the 
structure of other protistan goups, namely proto- 
steloid amoebae communities (Aguilar and Lado, 
2012). Likewise, the diversity of particular ciliate 
taxa are known to shift according to the biome 
(primarily arid versus mesic types) in which they are 
found (Bamforth, 1995; Chao et al, 2006), whereas 
some ciliates form cysts that allow these organisms 
to survive in arid areas even under drought condi- 
tions (Adl and Gupta, 2006). Our findings are 
therefore congruent with what is known about the 
ecology of some soil protist groups, and highlight 
the importance of higher-level taxa in shaping the 
biogeographical patterns that we observed. 

We also made an unexpected discovery that soils 
harbored relatively large numbers of protistan taxa 
long considered to be absent, or at least rarely 
encountered, in soil. For example, Apicomplexa 
taxa have been known to inhabit soils during 
encysting stages (Ruiz et ah, 1973), yet they have 
not previously been recognized as being abundant in 
soil. We found Apicomplexa (81% of these 
sequences represented the Eucoccidiorida) to be 
abundant in soils from more mesic sites (Figure 4); 
however, the possibility that some taxa recovered in 
this study may only represent resting stages from 
non-indigenous organisms accumulated over time 
cannot be explicitly ruled out by our methods. 
Eucoccidiorida are known as common animal para- 
sites such as Toxoplasma that can also encyst like 
ciliates. Their high abundance in more humid soils 
could be attributed to the fact that oocyst viability is 
likely dependent on adequate soil moisture (Ruiz 
et ah, 1973). Trends in Toxopiasma-related diseases 
follow similar patterns, with incidences being more 
frequent in areas with higher annual rainfall 



(Gomez-Marin et ah, 2011). Soil invertebrates are 
also known to harbor commensal or parasitic 
apicomplexans (Olsen, 1986); however, the numbers 
of these taxa associated with a host versus freely 
distributed throughout the soils sampled here 
cannot be conclusively determined by our methods. 
The Dinophyceae (99% of the sequences repre- 
sented the Heterocapsaceae) were also found to be 
highly abundant in humid soils (Figure 4). Members 
of this class are generally thought to be restricted to 
aquatic habitats; however, we found that they can 
also be abundant in soils. Taken together, these 
results and those reported in other recent molecular 
surveys (Lejzerowicz et ah, 2010; Coolen et ah, 
2011) suggest that protistan taxa long considered to 
be restricted to aquatic environments may also be 
quite common in terrestrial environments. 



Conclusions 

The results of this study, perhaps the most compre- 
hensive assessment of terrestrial protists conducted 
to date, suggests that, in addition to commonly 
known soil groups (for example, Cercozoa and 
Ciliophora), previously unrecognized taxa (for 
example, Apicomplexa and Dinophyceae) are also 
important members of soil protistan communities. 
Future studies using RNA-based approaches will be 
useful in confirming the presence of these pre- 
viously unrecognized taxa as being biologically 
active soil community members. As soil studies 
conducted in recent decades have stressed the 
incredible diversity of soil prokaryotic communities 
(Curtis et ah, 2002), we found that bacteria are not 
unique in this respect, as their diversity is similar in 
magnitude to that of soil protists. Protistan taxa were 
generally not cosmopolitan across the soils sampled 
here, and like their bacterial counterparts, soil 
protists exhibit predictable biogeographical pat- 
terns. Whereas pH has frequently been identified 
as the dominant factor driving global patterns of 
bacterial biogeography (Fierer and Jackson, 2006; 
Lauber et al, 2009; Griffiths et al, 2011), protistan 
biogeography is best predicted by climatic condi- 
tions that regulate annual moisture availability in 
soils at comparable scales of inquiry. Despite debate 
regarding the existence of biogeographic patterns for 
microorganisms (Green and Bohannan, 2006; 
Martiny et al, 2006), our survey adds to the growing 
body of literature suggesting that soil microbes, even 
those with the potential for sustained global dis- 
persal, have distinct biogeographies. 
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